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Abstract. The /i-limit set of a cellular automaton is a subshift whose forbidden 
O ' patterns are exactly those, whose probabilities tend to zero as time tends to in- 

finity. In this article, for a given subshift in a large class of subshifts, we propose 
the construction of a cellular automaton which realizes this subshift as /z-limit set 
' where ^ is the uniform Bernoulli measure. 
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1. Introduction 



Ch ' A cellular automaton (CA) is a complex system defined by a local rule which acts 

synchronously and uniformly on the configuration space. These simple models have 
a wide variety of different dynamical behaviors. More particularly it is interesting 
to understand its behavior when it goes to infinity. 
^ ■ In the dynamical systems context, it is natural to study the limit set of a cellular 

, automaton, it is defined as the set of configurations that can appear arbitrarily far 

in time. This set captures the longterm behavior of the CA and has been widely 
studied since the end of the 1980s. Given a cellular automaton, it is difficult to 
^ ■ determine its limit set. Indeed it is undecidable to know if it contains only one 

, configuration |Kar92| and more generally, every nontrivial property of limit sets 



is undecidable |Kar94] . Another problem is to characterize which subshift can be 
obtained as limit set of a cellular automaton. This was first studied in detail by 
Lyman Hurd |Hur87| . and significant progress have been made |Maa95| IFK07] but 



, , there is still no characterization. The notion of limit set can be refined if we consider 

\ the notion of attractor |Hur90a| IKur03] . 

However, these topological notions do not correspond to the empirical point of 
view where the initial configuration is chosen randomly, that is to say chosen accord- 
ing a measure /i. That's why the notion of /x-attractor is introduced by |Hur90b] . 
Like it is discussed in |KMOO) with a lot of examples, this notion is not satisfactory 
empirically and the authors introduce the notion of /i-limit set. A /i-limit set is 
a subshift whose forbidden patterns are exactly those, whose probabilities tend to 
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zero as time tends to infinity. This set corresponds to the configurations which are 
observed when a random configuration is iterated. 

As for hmit sets, it is difficult to determine the /x-hmit set of a given cellular 
automaton, indeed it is already undecidable to know if it contains only one configu- 
ration |BPT06) . However, in the literature, all /i-limit sets which can be found are 
very simple (transitive subshifts of finite type). In this article, for every recursively 
enumerable family (Sj)jgN of subshifts generated by a generic configuration, we con- 
struct a cellular automaton which realizes IJieN ^« /i-limit set. In particular all 
transitive sofic subshifts can be realized. It makes a strong difference with limit sets 
since there are sofic subshifts, as the even subshift (subshift on alphabet {0, 1} in 
which all words 01*^0 with odd k are forbidden), which cannot be realized as limit 
set |Maa95] . 

To construct a cellular automaton that realizes a given subshift as /x-limit set, 
we first erase nearly all the information contained in a random configuration thanks 
to counters (section [3]) . Then we produce segments, which are finite areas of com- 
putation. On each segment we construct small parts of the generic configurations of 
many subshifts, and as time passes, segments grow larger and every word of every 
subshift appears often enough (section |5]). 

2. Definitions 

2.1. Words and density 

For a finite set Q called an alphabet, denote Q* = U„gj^ the set of all finite 
words over Q. The length oi u = UqUi . . . Un-i is \u\ = n. We denote the set of 
configurations over Q, which are mappings from Z to Q, and for c G Q"^, we denote 
Cz the image of z G Z by c. For u E Q* and < z < j < |n| — 1 we define the suhword 
M[j j] = UjUj+i . . .Uj] this definition can be extended to a configuration c G as 
C[ij] = CiCi^i ■ ■ - Cj for j G Z with i < j. The language of S G is defined by 

'^('S') = G Q* : 3c G S*, 3z G Z such that u = C[j_j+|u|_i]}. 

For every u E Q* and z G Z, we define the cylinder [u]i as the set of configurations 
containing the word u in position i that is to say [u]i = {c G : C[i^i^\u\-i] = u}- If 
the cylinder is at the position 0, we just denote it by [u]. 

For all u,v E Q* define the number of occurences of in n as: 

\u\y = card{? G [0, \u\ - \v\] : U[i^i+\^\^i] = v} 
For any two words n, v E Q*, let du{v) = ^jpj^- 

For a configuration c G Q^, the density dc{v) of a finite word v is: 

dc{v) = limsup4[_„,„j(^')- 

These definitions could be generalized, for a set of words W <Z Q*, we note \u\w 
and dc{W). 

Definition 2.1 (Normal configuration). A configuration is said to be normal for 
an alphabet Q if all words of length n have the same density of apparition in the 
configuration. 
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2.2. Subshifts 

We denote by a the shift map cr : ^ Q'^ defined by cr(c)j = Cj_i. A subshift 
is a closed, o"- invariant subset of Q^. It is well known that a subshift is completely 
described by its language denoted C(T,). Moreover, it is possible to define a subshift 
by a set of its forbidden words which do not appear in the language. 

As the shift invariance is preserved, intersections and closures of unions of sub- 
shifts are still subshifts. And in particular, the union of a set (£(Sj))j of languages 
describes the subshift that is the closure of the union of all subshifts: IJjgj^ Sj. 

We define some classes of subshifts. A sofic subshift is a subshift whose language 
of forbidden words is rational, i.e. given by a finite automaton. A subshift S is 
transitive if for all u,v & ^C^) there exists a word w such that uwv G jC.{T,). Let 
s : Q — >■ Q* be a primitive substitution (there exists /c G N such that for all a,b & Q 
a appears in s^(6)), the substitutive subshift associated to s is the subshift such 
that 

= {u E Q* : 3a & Q and n E N such that u appears in s"'(a)}. 

2.3. Cellular automata 

Definition 2.2 (Cellular automaton). A cellular automaton (CA) ^ is a triple 
{Qaj ^a) where is a finite set of states called the alphabet, is the radius of 
the automaton, and : Q^"^^^ ^ Qa is the local rule. 

The configurations of a cellular automaton are the configurations over Qj(. A 
global behavior is induced and we'll note A{c) the image of a configuration c given 
by: Wz G Z, A{c)z = d^^Cz-rj^, • • • , c^, . . . , c^+r^). Studying the dynamic of A is 
studying the iterations of a configuration by the map A : Q\ Q\. When there is 
no ambiguity, we'll note Q, r and 5 instead of Q^, and 5^. 

2.4. ;U-limit sets 

Definition 2.3 (Uniform Bernoulli measure). For an alphabet Q, the uniform 

Bernoulli measure fi on configurations over Q is defined by: Vm G Q*, i G Z, fi{[u]i) = 
1 

|Q|k|- 

For a CA ^ = {Q,r,6) and u G Q*, we denote for all n G N, A^ij.{[u]) = 

/i(^-(M)). 

Definition 2.4 (Persistent set). For a CA A, and the uniform Bernoulli measure 
/i, we define the persistent set L^{A) with: Wu E Q*: 

u ^ LJA) ^ lim ^"Ai(Mo) = 0. 
Then the fi-limzt set of ^ is A^(^) = {c G : L(c) C Lf,{A)}. 

Remark 2.5. As this definition gives a set of forbidden finite words, we clearly see 
that /i-limit sets are subshifts. 

Definition 2.6 (Set of predecessors). We define the set of predecessors at time n of 
a finite word m for a CA ^ = {Q, r, 6) as P^{u) = {v E gl«l+2™ ; A"i[v]-rn) C [u]o}. 
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Remark 2.7. As we consider the uniform Bernoulli measure /i, |Q||„|+2rn — >• -v^ 
u i L^{A). 

Remark 2.8. The set of normal configurations has measure 1 in Q'^. Which means 
that a configuration that is randomly generated according to measure /i is a normal 
configuration. 

Lemma 2.9. Given a CA A and a finite word u, with /i the uniform Bernoulli 
measure, for any normal configuration c: 
u G A^(^) ^ rfyi"(c)(^^) ^ when n — +oo. 

Proof. Let n G N, r be the radius of A and /i the uniform measure. We prove here 
that: c/^n(c)(M) = ^Xm) = 

The second part of the equality is obtained by definition of A"'^{u). We focus on 
the first part. Since any occurence of u in A^{c) corresponds to an occurence of a 
predecessor of n in c : 



, t \ V |^"(c)[_fc,fc]|„ 

(c) [u) = lim sup _ , , = lim sup 





C[— fc— rn,fc+rn] 
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And as c is normal, for any v G P^{u) : \c[^k~rn,k+rn]\v ~fe^+oo iqIhW- 
Then: 

, . N V- 1- /^^^^_ 2^+1 \ _ V- 1 _ IP^HI 

d^"(c)(«j- 2^ ^™^^Pl^2A;+l-|n||Q|H+2rnJ - \Q\\u\+2rn " |g||«|+2.n- 

■ 

Proposition 2.10. Let u G L^j_{A), there exists a word w such that uwu G L^{A). 

Proof. Let u G L^(^), there exists a > and an increasing sequence (nj)jgN such 
that .A"*/x([m]) > a. Thus, for a normal configuration c, one has dj\j^i(^f.){u) > a for 
all 2 G N. Let / G N and e > such that t^tttt < a — e, we define 

= {w G • is not a subword of w and |w| < /} and 
W2 = {w G Q*j^ : n is not a subword of w; and \w\ > 1} . 
Consider uW^u = {uwu : w G Wi} for k G {1, 2}, one has 

dA"^{c)[uW2U) = hmsup < 

since a word of UW2U can appear at most 2|n| times for each pattern of length 2|m| + Z 
of^"'(c). Moreover (i^ni(c)(MH^iM)+(i^"i(c)(MW^2w) > (i^"i(c)(M) so (i^ni(c)(MiyiM) > e. 

Since VTi is finite, there exists a word w G H^i such that djinif^c){uwu) > e for an 
infinity of i G N. Thus uwu G L^(^). h 

Example 2.11. We consider here the "max" automaton Am- The alphabet contains 
only two states and L The radius is 1. When the rule applies to three (no 1), 
it produces a 0. In any other case, it produces a 1. 

The probability to have a at time t is the probability to have 0^*"''^ on the initial 
configuration. Which tends to when t — 00 for the uniform Bernoulli measure. 
So, does not appear in the /i-limit set. And finally A^(^j\,/) = 

And this example gives a difference between subshifts that can be realised as 
limit set (A(^) = C\i£N iQ^)) subshifts that can be realised as /i-limit set. 
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Effectively, A{Am) = (°°10*1°°) U(°°0°°) U(°°01°°) U(°°10°°), but if we apply propo- 
sition [2110] with the word 01, we conclude that A{Am) cannot be a /i limit set. 

3. Counters 

In this section and the following one, we describe an automaton As, which, on 
normal configurations, produces finite segments of size growing with time. In these 
segments, we will make computations described in section [5l 

Before starting the computation, the automaton As has a transitory regime 
which erases the random configuration and generate segments between ^ where 
the computation is done. To do that, we have a special state *, that can only 
appear in the initial configuration, and which generates two counters. Between two 
counters, the states are initialized and when two counters intersect, they compare 
their respective age. If they do not have the same age, the younger deletes the older 
one; if they have the same age, they disappear and we put the state ^ in order 
to start the computation. The notion of counters was introduced in |DPST10] to 
produce equicontinuous points according to arbitrary curves. 

We recall some ideas which allow to construct such automaton: 

• no transition rule produces the state *; 

• * produces two couples of signals, one toward the left and another one toward 
the right; 

• a couple of signal (called counter) is formed by an inner signal and an outer 
signal, which is faster. Their collisions are handled in the following way: 

— nothing other than an outer signal can go through another outer signal; 

— when two outer signals collide they move through each other and com- 
parison signals are generated; 

— on each side, a signal moves at maximal speed towards the inner border 
of the counter, bounces on it and goes back to the point of collision; 

— the first signal to come back is the one from the youngest counter and 
it then moves back to the outer side of the oldest counter and deletes it; 

— the comparison signal from the older counter that arrives afterwards is 
deleted and will not delete the younger counter's outer border; 

• between a left counter and a right counter, the configuration is initialized; 

• if two counters that have the same age meet, they disappear and produce 
the state 7^5 which start the computation described in section [5] 

• the state 7^5 becomes # which delimitates segments, this state can disappear 
if two adjacent segments decide to merge as described in section HJ or if a 
counter (necessarily younger) encounters it. 

The initialization of a configuration is illustrated in figure [1] The gray areas 
of computation begin on the left of a # produced by the meeting of two counters 
generated by a *. 

Lemma 3.1. There exists a constant K^. such that if two # are distant of k, they 
appeared before time k x K^. 

Proof. Consider two states # in the space time diagram separated by k cells. If 
# is not in the initial configuration, the only way to appear is to result from the 
collision of two counters coming from the left and from the right. Thus, in the initial 
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Figure 1: When two counters launched by a * meet, a # is produced and a com- 
putation is launched on the right. The computation area extends until it 
meets the inner signal of a counter or another 

configuration, it is necessary to have the state * between the two # to create the two 
This operation take at most k x where is the speed of an inner signal. ■ 

4. Merging segments 

We saw in Section [3l how a special state * on the initial configuration gave birth 
to counters protecting everything inside them until they meet some other counter 
born the same way. In this section, we will describe the evolution of the automaton 
As after this time of initialization. When two counters of the same age meet, they 
disappear and a # is produced. 

Definition 4.1 (Segment). A segment u is a subword of a configuration delimited 
by two # and containing no # inside. So, u G The size of a 

segment is the number of cells between both ^. 

There will be computations made inside segments, but we will describe it later. 
Thus, in a segment, there is a layer left for computations that remain inside the both 

and a "merging layer" that will contain signals necessary to the behavior with 
other segments. Every signal presented in this section will travel on this merging 
layer. The idea is the following: at some times, two neighbor segments will decide 
to merge together to form one single segment whose size will be the sum of both 
sizes plus one. And we will assure that each segment will eventually merge, so that 
no segment of finite size can still be in the /i-limit set of As- 

When a 7^ is produced in automaton As, it sends two signals, on its right and on 
its left to detect the first ^ on each side. If the signal catches the inside of a counter 
still in activity before reaching a it waits until the counter produces a Then 
both # have recognised each other and the segment between them is "conscious". It 
launches a computation inside it, and waits until it is achieved. We will assure later 
that this computation ends. When this is done, it will alternatively send signals to 
its left and to its right in order to propose successively to each neighbor to merge. 

For this purpose, it computes and stores the length n of the segment as a binary 
representation. Then the segment puts a L mark on its left ^, and waits for 
timesteps. If, during this time, the left side neighbor has not put a R mark on the 
common our segment erases the L mark, a signal is sent on the other side, and 
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i; i; 

### ## ### # 

Figure 2: A ^ stays until two segments merge. Computation happens in gray areas, 
and at its end a signal (...) is sent and stays on the left of the segment, 
then goes to the right, stays and comes back. This cycle continues until 
a neighbor's signal is on the same ^ at the same time. Then the # is 
deleted and another computation is started on the left. 

it puts a R mark on its right ^. It waits once again timesteps before erasing the 
R, sending a signal to its left, and starting over. The whole cycle takes 2{n'^ + n) 
timesteps as we consider a signal at speed 1 crossing a segment of size n. We request 
the signal to stay timesteps because as (n + 1)^ > + 2n, if two segments do not 
have the same size, their signals eventually meet during a cycle of the smallest one. 
So, the only case in which two neighbor segments that try to merge do not merge, 
is when they have same size and are correctly synchronized. Computing and storing 
n, and waiting can be done with a space log(?7,). 

This process ends when at the same time, both a L and a R mark are written 
on a When this happens, the two segments agree to merge together and they do 
it: the 7^ between them is erased, and the whole activity begins again, starting with 
the computation inside the new segment. 

The general behavior of the segments among themselves is illustrated in figure [21 
We prove the following claims for automaton As- 

Claim 4.2. For any two words u,v G Q* , with \u\ ^ \v\, if the word w = ^u^v^ 
appears at time t in a space-time diagram of As, one of the 3 # ofw has disappeared 
at time t + IQll""! + 2{\w\^ + \w\). 

Proof. If the word w exists at time t on a space time diagram, at time t + 
(respectively t + at most, the computation is achieved in u (resp. v). We 

suppose here that no 7^ in u; has disappeared, which means, u and v do not merge 
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with any other segment outside w. So at time t + IQI''"' both segments try to merge 
with another one. Assume \v\ > \u\ for example, the other case is totally symmetric. 
Then, as |f p > |mP + 2|m|, before the end of the cycle of v, they have put their mark 
simultaneously on their common 7^ for one timestep at least. And consequently, 
they have merged and one # has disappeared at time t + IQI'""' + 2(|wp + \w\). m 

Claim 4.3. // two segments of size less than k E N merge together, they do it at 
most IQl"^^ + 2{{2ky + 2k) timesteps after being formed. 

Proof. If they don't have the same size, lemma let us conclude. If they have the 
same size, their computations are achieved after IQP'^- And as their merging cycle 
takes the same time for both, if they do not merge during the first cycle, they will 
never merge. So if they merge, they do it before IQl"^^ + 2((2A;)^ + 2k). m 

Claim 4.4. For any two words u,v E Q* , with \u\ 7^ \v\, the word w = jj^ujj^vjj^ 
does not appear in K^{As)- 

Proof. We use the constant from lemma [XT] Denote 

T =\w\x Kc + \w\ {\Q\\'"\ + 2(|w|2 + \w\)). We prove that for t > T, P\^{w) = 0. 
If the two ^ encircling w never disappear, the dynamic inside w is not affected 
by the exterior. Through time, some other # possibly appeared and disappeared 
between them. But after time at most \w\ x Kc, they have all appeared. Since then, 
they will only disappear. There are less than |w| — 1 excedentary # that have to 
disappear. Considering lemma 14. 3[ one disappears at least every IQI'^"' +2(|wp+ 1^1) 
timesteps. After that, the two segments of w are formed, and with lemma l42| one 
of the ^ oi w disappear before IQI'""' + 2(|wp + \w\) new timesteps. Finally, at time 
T, one of the ^ oi w has disappeared and P^^{w) = 0. h 

Proposition 4.5. There is no # in the fi-limit set of As- 

Proof. Assume that # G L^(^), by Proposition I2.10[ there exits u E Q* such that 
G L^{A), we can assume that u does not contain Let k = \u\, by Lemma IXTl 
the ^ encircling u appeared before time kxKc- Denote W = : v G (Q\{#})^} 

and Xn = {x e : ^''(x)[o,fc+i] G W for all k e [k x Kc,n]}. Since #m# G Lf,{A), 
there exists a > such that fi{Xn) > a for an infinity of n G N. Moreover, as 
Xn+i C Xn, we can conclude that /i(Xoo) > a where X^o = Hneoo^N- 

As /i is Bernoulli, we have niY) > where Y = [*{Q \ *)^'^ * (Q \ *)^'^*]o; 
moreover there exist ki > and k2 > ki + Ak + 1 such that niZ) > where 
Z = X^n cT-^i(y) n a^''^{X^). For all n > k X one has F'^(Z)[o,fc+i] C W, 
F''(Z)[fc2,fc2+fc+i] C W and F'^{Z)[k^+kM+3k] C F'^{Y)[k^+kM+3k] does not contain #. 

We deduce that there exists a word w E Q oi length k2 — k — 1 such that 
w^[A;i+fc,fci+3fc] does not contain # and G Lf^{A). However, in ^u^w^^u^ 

we can find two segments #ni#U2# which have different length. By Claim U3] we 
obtain a contradiction. Thus, there is no 7^ in the /i-limit set of As- ■ 

Finally, we prove a lemma that will be useful later. 

Claim 4.6. The density of cells outside segments generated by counters born in the 
initial configuration tends to 0. 



Proof. The proof is clear since such a cell needs predecessors without states * on 
each side in the initial configuration. ■ 
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Lemma 4.7. Let u ^ Q* . If\/k,\/l > k, for any segment v G , dy{u) < with 
ftfe — when k — > oo, then u ^ L^{As)- 

Conversely, ifWk,Wl > k, for any segment v G Q'' , d^iu) > au with ^ when 
k oo, then u G Lfj_{As)- 

Proof. Let's consider a normal configuration c. For any G N, we denote 

4 = ^ X dA%{c){v) 

ve#{QAy#, i<k 

the density of cells in segments of size less than k in the image at time t of c. Due 
to proposition 14. 5^ — when t — )■ oo. And due to claim 14. 6^ the density a* of 
cells outside wellformed segments tends to when t — oo. 

Suppose VA;,V/ > A;,Vf G segment, d^i^u) < and — > 0. Any occurence 
of u is either in a segment of size less than A;, either in a segment of size greater than 
/c, or out of segments. Finally, at a given time t, (i_4t^(c)(u) < d\ + + a*. 

As this equation holds for any fc, finally, when t — )■ oo, (i^t^(c)(u) has a limit 
which is 0. This concludes the proof of the first part of the lemma with lemma [2^ 

In the other side, suppose VA;,V/ > k,^v E segment, dv{u) > and ^ 0. 
Therefore, d_^t^(^c){u) > (1 — — a*)afc which does not tend to when t — )■ oo and 
k ^ oo. Thus, u G ^^(^5). h 



5. Infinite Unions 

In this section we will see how to create a CA whose /i-limit set is the closure 
of the infinite union of a recursively enumerable family of particular subshifts. 

Definition 5.1 (Generable Subshift). We say that a Turing machine M generates a 
subshift S C if M computes a generic configuration of S in the following sense: 

• the tape alphabet of M contains Q; 

• on an empty tape, M writes the right half of a configuration c G S such 

that limsup„_j.j^ > if and only if u G c is called a generic 

configuration; 

• after a symbol of Q has been written on the tape, it is never changed. 

Theorem 5.2. Given a recursively enumerable family (Sj)jgN of generable subshifts, 
that is to say that there exists a Turing machine that enumerates a set of machines 
{Mi)i G N such that Mi generates the subshift Sj, there exists a cellular automaton 
A whose fi-limit set is exactly the subshift (J.^j^Sj. 

Proof. Let us consider a recursively enumerable family (Si)jgN of generable subshifts, 
let us denote by M the Turing machine that enumerates the machines (Mj)jgN such 
that Mi generates the subshift Sj. 

We now describe the behavior of such a cellular automaton A. A will work as 
the automaton As described in Section HJ starting from a normal configuration, it 
will generate "counter signals" that will produce finite segments on the configuration 
(separated by a # symbol). We now describe the computation performed by each 
finite segment during the evolution of the cellular automaton. 

The first thing a segment does is compute its length n and store it as a binary 
number. By incrementing a binary counter moving across the segment, this is easily 
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done in space log(n). Once this is done, the segment can simulate Turing machines 
on its first log(n) cells (it is important to limit the computational space so that the 
computation states become negligible and disappear from the /x-limit set). 

On the initial log(n) cells of the segment the machine M is simulated to produce 
the descriptions of the first k machines {Mi)i^k, with k as big as possible for M 
computing on a tape of length log{n). And we also request that k < log(log(?7,)). k 
may be for short segments, but we know that as the segments grow larger, k will 
grow too. 

The space of size log(?7,) is further divided into k fragments of size \og{n)/k. On 
the z-th fragment, the corresponding machine Mj is simulated to produce the word 
Wi beeing the begining of the generic configuration corresponding to the subshift Ej. 
The word Wi might be much smaller than log(n)/A; depending on the space needed 
by the machine Mi to compute, but again we know that as segments grow larger, 
larger words will be computed. 

After the k different Wi have been computed, the initial segment of length n is 
split into ^Jn fragments of length ^/n. Each of these fragments is filled with copies 
of one of the Wi in the following manner: one out of two is filled with Wi, one out of 
four (i.e. one out of two among the remaining fragments) is filled with W2i one out 
of eight is filled with W3 and so on. The remaining segments (if k is very small, we 
might run out of Wi before filling all the fragments) are filled with Wk- Fragments 
are separated by a symbol %i ^ Q and the copies of words Wi inside a given fragment 
are separated by a symbol $2 ^ Q. 

Remark 5.3. The previous construction can be done using only log(n) cells of 
computation at each step (cells that are not active and that only contain a symbol 
from Q U {$1, $2} are not counted). To fill the fragments of size ^/n we only need to 
compute the binary expression of y/n and then advance through the segment while 
filling the fragment with the appropriate Wi while decreasing a counter to measure 
^/n cells. The important data (the words Wi and different counters) are moved 
through the segment so that they are always present near the location to be filled. 
Thus the head of the Turing machine M carries only log(n) cells used to store the 
Wi and to its computation. No mark of the computation remains in the other cells, 
even those already visited and rewritten. 

When all the fragments of the segment have been filled with the Wj, the segment 
can erase all the remaining computation data and start the process of merging with 
its neighbors as described in Section SI 

When two segments merge, the whole computation is restarted but this time 
with a larger space. The segments are not erased immediately after a merge, but 
rather the new data overwrites the previous as the ^/n fragments are filled. 

We will prove that Lf^{A) = [j-^^ C{T.i). 

Claim 5.4. The states used for computation, signals inside segments, writing frag- 
ments, $1 or $2 do not appear in A^(^). 

Proof. Here we use the lemma 14.71 for each of these states. 

We use the log(fc) initial cells of a segment of size k to do the computation, so 
the density of these cells is log(A;)/A;, and the property is proved. The head of the 
Turing machine M carries at most log{k) cells for its computation or writing, thus 
the same argument works. The signals for the merging process are in a finite number 
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in a segment, therefore their density in a segment tends to too. The density of $i 
is y/k/k, and the lemma apphes once again. 

For the density of $2, let A > 0, 3A;o > such that the word Wi produced in 
a segment of size k > ko is such that \'Wi\ > A for any i < X. So, for k > ko, the 
density of $2 in a segment of size k is less than 1/A in fragments of Si,i < X and 
less than 1 in the other fragments that have themselves a density lower than 1/2'*'. 
And thus, the density of $2 is lower than ^ + ^ in segments of size k > ko. Finally 
the density of $2 tends to when k ^ 00. And the claim is proved. h 

Claim 5.5. For any subshift Sj, z G N, any word u G and any family of 

segments {vk)k of size \vk\ = k, d^i^{u) does not tend to when k — )■ 00. 

Proof. As M G C{T,i), its density a{u) in the generic configuration computed by Mj is 
positive. So, there exists /j G N such that any subword of this configuration contains 
u with density at least a{u)/2. Let ko such that in any segment of size k > ko, the 
word Wi computed has length \wi\ > U. 

For any segment Vk of size k > ko, there are log(A;) cells occupied for com- 
putation, less than \/{k) cells containing a $1 and ^ifj among the remaining cells 
attributed to the copies of Wi. Among these copies, a proportion of the cells 
contain $2. log(/c) additional cells can be dedicated to the head of the Turing ma- 
chine M writing in the segment and a finite number K of cells can contain signals 
for the merging process. Finally, 

( ( k-hgk)^^/lk)\ k - 1 
dv,{u) > I I ^,+1 1 — log(fc) - 

Which does not tend to when k 00. 

Claim 5.6. For any subshift Sj, z G N and any word u G CiT^i), u G L^(^). 
Proof. We clearly get the result by combining claim 15.51 and lemma 14.71 




Finally, the theorem is proven: 

• the proposition 14.51 and the claim 15.41 assure that every state used for com- 
putation does not appear in A^(^), which means L^{A) C [J^gj^£(Si), 

• the claim [5^ assures that [J.gj5^£(Sj) C L^{A). ■ 

The next proposition gives some examples of generable subshifts. 

Proposition 5.7. The following subshifts are generable: 

• transitive sofic subshifts, 

• substitutive subshift associated to a primitive substitution. 

Proof. As a transitive sofic subshift S is given by the strongly connected automaton 
recognizing its language. For example, we can write successively every cycle of size 
k for k from 1 to 00. In this case we obtain a configuration where the density of all 
the words of the language of S is positive. 

For a primitive substitution s, it is easy to generate the fix point configuration 
denoted C[o;oo] whose all prefixes are given by s''{a) for all /c G N where a E Q. It 
is well know that all words of the substitutive subshift associated appears with a 
positive density in C[o;oo] |Fog05 . ■ 
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6. Conclusion and perspectives 

In this paper, we prove that a large class of subshifts can be realized as /i-limit 
sets of cellular automata. In particular, it is possible to obtain all transitive sofic 
subshifts, this is a profound difference with the topological case since the even shift 
cannot be realized as the limit set of one cellular automaton. This construction 
allows to control the iterations of a random configuration in view to obtain an auto- 
organized behavior. The construction can be adapted at least in two ways: 

• to obtain the same result for a large class of measure (a-ergodic measure of 
full support) modulo some technical changes 

• to obtain a subshift without any word of low complexity (as suggested by V. 
Poupet). 

Of course the main open question is in the reciprocal of the theorem, that is to 
say to characterize subshifts that can possibly be realized as /i-limit sets. 
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